Neural Generation of Regular Expressions from Natural Language with Minimal Domain Knowledge
نویسندگان
چکیده
This paper explores the task of translating natural language queries into regular expressions which embody their meaning. In contrast to prior work, the proposed neural model does not utilize domain-specific crafting, learning to translate directly from a parallel corpus. To fully explore the potential of neural models, we propose a methodology for collecting a large corpus1 of regular expression, natural language pairs. Our resulting model achieves a performance gain of 19.6% over previous state-of-the-art models.
منابع مشابه
Using XML and Regular Expressions in the Syntactic Analysis of Inflectional Language
In this paper we describe an approach to representation of data and knowledge using two technologies: XML and regular expressions in a domain of natural language syntactic analysis. Analysis of text written in natural language requires several lexicons that aid the process of syntactic analysis. Moreover knowledge about the language (e.g., syntactic rules) should be represented and interpreted....
متن کاملTowards modeling the semantics of calendar expressions as extended regular expressions
This paper proposes modeling the semantics of natural-language calendar expressions as extended regular expressions (XREs). The approach covers expressions ranging from plain dates to such ones as the second Tuesday following Easter. The paper presents basic calendar XRE constructs, sample calendar expressions with their representations as XREs, and possible applications in reasoning and natura...
متن کاملGenerating Anaphoric Expressions: Pronoun Or Definite Description?
In order to produce coherent text. natural language generation systems must have the ability to generate pronouns in the appropriate places. In the past, pronoun usage was primarily investigated with respect to the accessibility of referents. We.argue that generating appropriate referring expressions requires looking at factors beyond accessibility. Also important are sentence boundaries, dista...
متن کاملسیستم شناسایی و طبقهبندی موجودیتهای اسمی در متون زبان فارسی بر پایه شبکه عصبی
Named Entity Recognition (NER) is a fundamental task in natural language processing and also known as a subset of information extraction. We seek to locate and classify named entities in text into predefined categories such as the names of persons, organizations, locations, expressions of times, etc. Named Entity Recognition for English texts has been researched widely for the past years, howev...
متن کاملOntologies as a Source for the Automatic Generation of Grammars for Information Extraction Systems
Grammars for Natural Language Processing (NLP) applications are generally built either by linguists – on the basis of their language competence, or by automated tools applied to existing large corpora of language data — using either supervised or unsupervised methods (or a combination of both). Domain knowledge usually played just a little role in this process. The increasing availability of ex...
متن کامل